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(57) The present invention has an architecture 
capable of eliminating a. necessity for previously desig- 
nating in a document, when, reading the document, 
attributes for reading. Focusing on utilizing the docu- 
ment with attributes, a document reading system ana- 
lyzes contents of the attributes and reads texts in the 
document with a voice synthesizing module. The 
attributes are set irrespective of reading conditions. A 
basic reading condition setting module 3 sets the read- 
ing conditions for the entire document. An individual 
reading condition setting module 5 sets the reading con- 
dition for every attribute. A selective reading module 15, 
when reading the document, in principle, reads the text 
with reference to the basic reading condition set by the 
basic reading condition setting module, and selectively 
reads the text having the individual reading condition 
with reference to the individual reading condition given a 
higher priority than the basic reading condition. 
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Description 

Technical Filed 

[0001] The present invention relates to a document 
reading system for reading aloud texts in a document 
inputted to a computer, and to a read control method. 

Background Arts 

[0002] A system disclosed in. e.g.. Japanese Patent 
Application Laying-Open Publication No.8-272388 is 
known as a document reading system in the prior art. 
[0003] This document reading system includes, as 
a voice synthesizing module for vocally synthesizing 
. text data in kanji characters mixed with kana characters, 
an ennbedding module for embedding control data into 
text data, and an output module for synthesizing and 
outputting the voices based on the text data with a voice 
quality corresponding to the control data. 
[0004] This type of system, however, needs to 
embed the control data into the text data beforehand in 
order to read a certain portion with some quality of 
voice. 

[0005] Accordingly, if a scheme is that, in the case 
of reading the HTML texts on the Internet, some propor- 
tion of texts are read in a male voice and the rest of 
those texts are read in a female voice, it is required that 
the control data be written into the desired portion in the 
document each time, which is considered laborious. 
[0006] The present invention, which was devised 
under such circumstances, aims at providing a technol- 
ogy capable of eliminating a necessity for embedding 
the control data for giving a reading condition into the 
document each time. 

Disclosure of Invention 

[0007] . According to the present invention, focuss- 
ing on such a point that an HTML (Hyper Text Mark-up 
Language) text and an RTF (Rich Text File) text previ- 
ously contain attribute data (which will hereinafter be 
called tags as the case may be) for defining a decora- 
tion condition, etc. with respect to the text in the docu- 
ment irrespective of reading in voice, the tag is utilized 
as control data for reading. 

[0008] A document reading system for analyzing 
contents of a document with attributes and reading texts 
in the document by use of a voice synthesizing module, 
comprises a basic reading condition setting module for. 
the attributes being set irrespective of reading condi- 
tions, setting the reading conditions for the entire docu- 
ment, an individual reading condition setting module for 
setting the reading condition for every attribute, and a 
selective reading module for, when reading the docu- 
ment, in principle, reading the text with reference to the 
basic reading condition set by the basic reading condi- 
. tion setting module, and selectively reading the text hav- 



ing the individual reading condition with reference to the 
individual reading condition given a higher priority than 
the basic reading condition. 

[0009] Herein, the reading condition is any one of at 
5 least a quality (e.g.. a voice pitch, a male or female 
voice), a volume (loudness of voice) and an accent 
(voice intonations and dialects) of a reading voice, and 
a selection of whether to read or not. For example, sup- 
posing that there is a text such as [<h2>introduction of 
10 this homepage</h2>] in the HTML document, tags 
<h2></h2> specify a size of characters interposed 
between these tags when displayed. Then, in relation to 
these tags <h2></h2>, the characters therebetween is 
read in a male voice. 
15 [0010] In particular, it is preferable that the reading 
condition set for every attribute by the individual reading 
condition setting module be related to an essential 
meaning of the attribute, and an essential meaning des- 
ignated by the attribute can be associated from the . 
20 reading voice. 

[0011] To be specific, the tags <h2></h2> indicate 
the size of the characters, h2 represents a display larger 
than h3 but smaller than hi. Then, a docunrient speci- 
fied by h2 is read louder than by hS but fainter than by 
25 hi, vyhereby the reading can be done in a mode based 
on the HTML protocol and the document can be visually 
associated simply by hearing the reading voice. 
[0012] The document re^ading system may further 
comprise a reading condition storage module for storing 
30 the reading conditions. 

[0013] In the document reading system according 
to the present invention, the basic reading.condition set- 
ting module sets the reading conditions for the entire 
document, and subsequently the individual reading con- 
as dition setting niodule sets the reading condition for 
every attribute. 

[0014] When reading aloud the document, the text 
with no particular designation is read referring in princi- 
ple to the basic reading condition set by the basic read- 

40 ing condition setting module. 

[0015] Various items of data can be known from the 
tags in the document. In the case of the HTML, the var- 
ious tags of a title of the page, a heading thereof, con- 
tent texts, links, mail addresses and so on. are 

45 described in the document, and, on a screen, character 
sizes and colors are selectively set corresponding to the 
tags. In the prior art reading system, however, all the 
portions are read in the same voice, with the result that 
these items of data are lost. According to the present 

50 invention, the tag data can be recognized in voice by . 
setting the reading conditions con-esp ending to the 
essential data contents of the tags. 
[001 6] Note that the attribute given to the document 
serves to control, e;g., displaying the document- Fur- 

55 ther. when the document is an HTML document, the 
attribute given to the document is tag data. . 
[0017] A document reading method of reading texts 
in a document by use of a voice synthesizing module 
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according to the present invention, connp rises a step of 
judging an attribute, in the docunnent, for controlling dis- 
play of the docunnent, and a step of changing a condi- 
tion of reading the text of which the display is controlled 
by the attribute on the basis of a result of the judgement. 5 
[0018] In this document reading method, the read- 
ing condition may be changed corresponding to a cate-. 
gory of the attribute. 

[0019] According to the present invention, there is 
also provided a readable-by-computer recording 10 
medium recorded with a program for making a compu- 
ter, for reading texts in a document* by use of a voice 
synthesizing module, execute a step of judging an 
attribute, in the document, for controlling display of the 
document, and a step of changing a condition of reading is 
the text of which the display is controlled by the attribute 
on the basis of a result of the judgement. 
[0020] . The . readable-by-computer recording 
medium may be recorded with a program for making the 
. computer further execute a step of changing the reading 20 
condition corresponding to a category of the attribute. 

Brief Description of the Drawings 



[0021] 



25 



FIG. 1 is a block diagram showing an example of an 
architecture of the present invention; 
FIG. 2 is a chart showing an example of setting 
reading conditions (fixed values); 30 
FIG. 3 is a chart showing an example of setting the 
reading conditions (relative designations to the 
basic settings); 

FIG. 4 is a view showing one example of a basic 
reading condition setting module; 35 
FIG. 5 is a view showing an individual reading con- 
dition setting module and an individual reading 
ON/OFF specifying module; . 
FIG. 6 is a flowchart showing reading steps; 
FIG. 7 is a view showing an example of displaying 40 
an HTML document by way of one example of a 
reading target; 

FIG. 8 is a view showing the reading target in FIG. 
7 as source data; 

FIG. 9 is a chart showing a recommendation setting 45 
pattern of reading conditions; 

FIG. 10 is view showing reading target text data; 
and 

FIG. 11 is a view showing reading assist data. 

50 

Best Mode for Carrying out the Invention 

[0022] . FIG. 1 shows an architecture in one embod- 
iment of the present invention. . 
[0023] According to a program-based architecture ss 
of the present invention, a CPU of a computer executes 
this program, thereby actualizing function performing 
modules on the CPU as shown in FIG. 1 . ' 



[0024] As illustrated in FIG. 1. the document read- 
ing system includes a document management module 1 
for managing documerit information read by the compu- 
ter through a recording medium such as a floppy disk 
and a CD-ROM and media like Internet, etc.. 
[0025] This document management module 1 is 
defined as software for reading and downloading docu- 
ments such as HTML texts, RFT texts, etc. 
[0026] The document reading system further 
includes an attribute analyzing module 2 for analyzing 
the document read by the document management mod- . 
ule 1 of the computer and thus detecting a [tag] as. an 
attribute thereof. Then, a reading target information 
management module 3 manages, as reading target 
information, the document read by the document man- 
agement module 1 and the tag analyzed by the attribute 
analyzing module 2. 

[0027] On the other hand; the document reading 
system has a basic reading condition setting module 4 
for setting reading conditions for the whole document, 
. an individual reading condition setting module 5 for set- 
ting the reading condition per attribute and an individual 
reading ON/OFF specifying module 6 for setting 
whether a text document with an attribute specially des- 
ignated as an individual reading condition, though it is a 
concept embraced by the individual reading condition 
setting module 5, should be [read aloud (ON)] or [not 
read aloud (OFF)], of which settings are done through 
an input device such as a keyboard, etc.. 
[0028] The document reading system further 
includes a basic reading condition management module 
11, an individual reading condition management mod- 
ule 12 and an individual reading ON/OFF information 
management module 13, which manage the respective 
conditions set by the basic reading condition setting 
module 4. the individual reading condition setting mod- 
ule 5 and the individual reading ON/OFF specifying 
module 6, and write or read the respective conditions to 
or from a hard disk serving as a reading condition stor- 
age module 10 through a basic reading condition I/O 
module 7, an individual reading condition I/O module. 8 
and an individual reading ON/OFF. information I/O mod- 
ule 9. 

[0029] Then, when reading the document, the basic 
reading condition management module 11, the individ- 
ual reading condition management module 12 and the 
individual reading ON/OFF information management 
module 13, read the respective conditions from the hard 
disk serving as the reading condition storage module 10 
through the basic reading condition I/O module 7, the 
individual reading condition I/O module 8 and the indi- 
vidual reading ON/OFF information I/O module 9, and 
transmit data about those conditions to a voice synthe- 
sizing module 14. 

[0030] The voice synthesizing module 1 4 includes a 
selective reading module 15 for reading aloud, with 
reading targets being set to the document data and the 
[tag] defined as an attribute which are mariaged by the. 



5 



EP 1 077 403 A1 



6 



reading target information management module 3. at 
first the text with reference to the basic reading condi- 
tion set by the basic reading condition setting module 4, 
and selectively reading the text having the individual 
reading condition with reference to the individual read- 
ing condition given a higher priority than the basic read- 
ing condition. 

[0031] Note that a voice synthesizing mode used 
herein upon reading has hitherto been known and is not 
therefore explained in particular. 

[0032] Herein, FIG. 2 shows an example where the 
reading conditions are set in fixed values. Referring to 
FIG. 2, the reading conditions are voice loudness, a 
voice pitch, a voice type (male voice and female voice) 
and a voice intonation. 

[0033] Then, the basic reading condition setting 
module 4 sets, as basic conditions, the loudness, the 
pitch, the type (male voice and female voice) and the 
intonation of the voice as shown In FIG. 2. Further, the 
individual reading condition setting module 5 sets the 
conditions shown in FIG. 2 with respect to tags .1 — 4, 
[0034] FIG. 3 is a chart showing a case where the 
fixed values shown in FIG. 2 are designated relatively to 
the basic settings. Herein, FIG. 3 is a representation 
shown relatively to the basic set values as standard val- 
ues. 

[0035] The settings by the basic reading condition 
setting module 4, the individual reading condition set- 
ting module 5 and the individual reading ON/OFF spec- 
ifying module 6 are inputted specifically on an input 
screen as shown in FIGs. 4 and 5. . . 
[0036] FIG. 4 shows an example of the setting by 
the basic reading condition setting module 4.. FIG. 5 
shows the settings by the individual reading condition 
setting module 5 and the individual reading ON/OFF 
specifying module 6. Herein, names of tags of the 
HTML documents are indicated in a selective reading 
target column Rl, and the name of the actual tag of the . 
name indicated in the column Rl is shown in a column 
R2 under the column Rl. On the right side of the col- 
umns R1 and R2, a check box R3 for designating the 
read is provided as the individual reading ON/OFF 
specifying module 6 for setting whether to individually 
read with respect to the reading target tag. An individual 
setting check box R4 for setting the voice loudness, the 
voice pitch and the voice type is provided as the individ- 
ual reading condition setting module 5 under the check 
box R3. The individual setting check box R4 is activated 
when [Reading] is set in the check box R3. 
[0037] With respect to the settings described 
above, as shown in FIG. 2, the specific values may be 
set as the tag data, and the settings relative to the basic 
settings may also be taken. In the case shown in FIG. 2, 
an advantage is that the set value per tag is retained 
without being influenced by the basic settings. In the 
case shown in FIG. 3, the representation can be given 
by the relative designation to the basic settings, and 
hence it is feasible to make . a sensuous designation 



such as [Lefs try to read in a louder and higher-pitch 
voice than in the normal portion] without indicating the 
specific numerical values. By use of those pieces of 
information, the attribute analyzing module 2 analyzes 

5 the tags with respect to the document data obtained by 
the document management module 1 in FIG. 1, and 
transfers a result of the analysis as reading target data 
to the voice synthesizing module 14. 
[0038] ; On the other hand, based on the basic read- 

10 ing voice setting previously designated^ and the reading 
voice setting per tag, the voice synthesizing module 1 4 
reads the given reading target data aloud by use of the 
designated voice attribute. 

[0039] This reading steps will be explained referring 
15 to a flowchart in FIG. 6. 

[0040] This is an example of reading the HTML doc- 
ument shown in FIGs, 7 and 8. FIG. 7 shows an exam- 
ple where the HTML document is displayed on browser. 
FIG. 8 shows source data thereof. In this example, the 
20 reading voice setting per tag of the HTML has already 
. been set, and it is herein assumed that a recommenda- 
tion pattern shown in FIG. 9 be set. This recommenda- 
tion pattern is a pattern preset as a standard modei in 
the reading condition storage module 10. . 
25 - [0041] To start with, in step 101. the document 
management module 1 downloads the source data 
shown in FIG. 8 and reads the data as an HTML file. 
Next, the HTML attribute analyzing rnodule 2 executes 
an analysis on a character-basis from the head of the 
30 data of the HTML file. A character or symbol interposed 
between and in the data is interpreted as a tag. 
If not categorized as the selective reading target tag. 
that may be ignored, and, whereas if being the selective 
reading target tag. the reading target information man- 
as agement moduje 3 stores the memory with the reading 
target text.shown in FIG. 10 (step 103). Then, the read- 
ing target information management module 3 stores the 
memory with reading assist data shown in FIG. i 1 (step 
104). Herein, the selective reading assist data are set- 
40 ting data of a position and a voice in the reading target 
text data. 

[0042] In the case shown in FIG. 8. the processing 
is done as follows: 

45 (1 ) Data [male voice, loudness = 3. pitch = 3] written 

in a column [other tags] in a voice setting table 
(FIG. 9), are registered as a voice initial setting, 
Reading is carried out in this mode for the first time. 
(2) The first line is processed. A tag <html> is out of 

50 the reading target and is therefore ignored. 

' (3) The second line is processed. A tag <head> is 
out of the reading tai'get and is therefore ignored. A 
next tag <title> is defined as [reading OFF] in the 
voice setting table (FIG. 9), and hence the. reading 

55 is skipped over to a corresponding tag </title>. A 
next tag </head> is also out of the. reading target 
and is therefore ignored. 

(4) The third line is processed. A tag <body> is out 
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of the reading target and is therefore ignored. 

(5) The fourth line is processed. A tag <br> is out of 
the reading target and is therefore ignored. A next 
sentence is additionally registered as a reading tar- 
get to [reading target text data]. 

(6) The fifth line is processed. The sentence is addi- 
tionally registered as a reading target, 

(7) The sixth line is processed. A tag <comnnent> is . 
set as [reading OFF] in the voice setting table, and 
therefore the reading is skipped over to a corre- 
sponding tag </comment>. 

(8) The seventh line is processed. The reading is 
skipped over two tags <br> and <center>. With a 
next tag <font size = 2>, the voice setting is 
changed to (male voice, loudness = 2. pitch = 3), 

. which is stored in [reading assist data]. Further, a 
text delinnited by a tag </font> is additionally regis- 
tered as a reading target. ' 

(9) The eighth line is likewise processed. The voice 
setting is changed to (male voice, loudness = 5, 
pitch = 4) corresponding to <font size = 6>, which is 
stored in [reading assist data). Further, the text 
delimited by the tag </font> is additionally regis- 
tered as a reading target. . 

(10) Next, the voice setting is initialized to (male 
voice, loudness = 3. pitch = 3), and a text is also 
registered. 

(11) The processing of a ninth line is that only a text 
is added. The tag <br> is ignored. 

(12) The processing of a tenth line is that a text 3o 
down to [For getting it,] is registered in the reading 
target text data. Next, the voice setting is initialized 
corresponding to <a href>, and a subsequent text is 
registered. 

(13) The processing of an eleventh line is that only 35 
a text is added. The tag <br> is ignored. 

(14) Tvyelfth and thirteenth lines are processed so 
that tags are ignored, and the processing comes to 
an end. 

40 

[0043] As a result, the data are stored in [reading 
target text data] and [reading assist data]. The voice 
synthesizing module interprets those pieces of data and 
synthesizes the voices. 

[0044] As discussed above, the selective reading 45 
module 1 5 is capable of selectively reading in an elabo- 
rate manner by use of the tags as the elements of doc- 
ument. For instance, if [reading] is designated to only a 
heading of HTML, it follows that only the portions con- 
sidered important are extracted and read aloud in gen- so 
eral. Further, it is also feasible to designate in such a 
mode that the portion with a larger font is read louder, 
and the portion with a smaller font is read fainter. There- 
fore, even a nuance of se^ntence that can not be con- 
veyed simply by reading with no intonation can be 55 
expressed by reading in the voice synthesization with- 
out looking at the screen. 



(Other Examples) 

[0045] The attribute analyzing module 2 analyzes 
the tags in the document, thereby recognizing various 
5 pieces of information. In the case of the HTML, the tags 
of the title of the page, the heading thereof, the content 
texts, the links, the mail addresses and so on, are 
described in the document, and, on the screen, the 
character sizes and colors are selectively set corre- 
10 spending to the tags. 

. [0046] Then, the reading conditions corresponding 
to those pieces of data are set corresponding to mean- 
ings and contents of the tags. These settings are stored 
beforehand in a way of corresponding to the tags in an . 
15 unillustrated table. With this contrivance, the same tag 
is read aloud always in the same voice, and the reading 
voice may be set louder or fainter corresponding to the 
character size with reference to the table each time the 
tag is analyzed. It is therefore possible to set the reading . 
20 condition corresponding to the essential data content of 
the tag and to recognize the tag data in voice. 
[0047] According to the present invention, the doc- . 
urinent can be selectively read aloud by. utilizing the 
attribute data as they are. which fiave been set before- 
25 hand in the document, and there is no. necessity for set- 
ting each piece of attribute data for the selective reading 
in the document, 

[0048] Then, if the reading condition set per 
attribute by the individual reading condition setting mod- 
ule 5 is related to the essential meaning of the attribute, 
the essential meaning designated by the attribute can 
be associated from the voice uttered by reading, and 
the contents of the document that have been read in 
voice can be visually comprehended. 

Industrial Applicabilitv 

[0049] The present invention is applicable to infor- 
mation processors such as a personal computer, a word 
processor and so on that are capable of dealing with the 
text data, and to an Internet terminal capable of dealing 
with the HTML document. 

Claims 

1. A documisnt reading system for analyzing contents 
of a document with attributes and reading texts in 
the document by use of a voice synthesizing mod- 
ule, comprising: 

a basic reading condition setting module for, 
the attributes, being set irrespective of reading 
conditions, setting the reading conditions for 
the entire document; 

an individual reading condition setting. module 
for setting the reading condition for every 
attribute; and 

a selective reading module for^ when reading 
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the document, in principle, reading the text with 
reference to the basic reading condition set by 
said basic reading condition setting module, 
and selectively reading the text having the indi- 
vidual reading condition with reference to the 5 
individual reading condition given a higher pri- 
ority than the basic reading condition. 

2. A document reading system accorcling to claim 1,. 
wherein the reading condition is any one of at least 10^ 
a quality, a volume and an accent of a reading 
voice, and a selection of whether to read or not. 

3. A document reading system according to claim 1 , 
wherein the reading condition set for every attribute 75 
by said individual reading condition setting module 

is related. to an essential meaning of the attribute, 
and an essential meaning designated by the 
. attribute can be associated from the reading voice. 

20 

4. A document reading system according to claim 1. 
. further comprising a reading condition storage 

module for storing the reading conditions. 

5. A document reading system according to claim 1, 25 
wherein the attribute serves to controj displaying 
the document 

6. A document reading system according to claim 1, 
wherein the document is an HTML document, and 30 
the attribute is tag data 

7. A document reading control method of i-eading 
texts in a document by use of a voice synthesizing 
module, comprising: 35 

judging an attribute, in . the document, for con- 
trolling display of the document; and 
changing a condition , of reading the text of 
which the display is controlled by the attribute 40 
on the basis of a result of the judgement 

8. A . document reading control method according to 
claim 7, wherein the reading condition is changed 
corresponding to a category of the attribute. 45 

9. A storage medium readable by a computer, tangibly 
embodying a program of instructions executable by . 
the computer to perform method steps for display- 
ing content data, for reading texts in a document by so 
use of a voice synthesizing module, the method, 
steps comprising: 

judging an attribute, in the document, for con- 
trolling display of the document; and 55 
changing a condition of reading the text of 
which the display is controlled by the attribute 
on the basis of a result of the judgement 



10. A storage medium readable by a computer, accord- 
ing to claim 9, for making a computer further com- 
prising: 

a step of changing the reading condition corre- 
sponding to a category of the attribute. 
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FIG. 6 
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FIG. 9 
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FIG. 10 
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